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SuperCollider &i supernova 

Challenges of Parallel Audio Synthesis 

SuperCollider 




► sclang, a real-time scripting language, with a huge class 
library 

► scsynth, an audio synthesis engine 

► a huge number of unit generators, provided as plugins 

► a gui system (with 2 implementations) 

► several IDEs 
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SuperCollider &i supernova 

Challenges of Parallel Audio Synthesis 

supernova 




► a multi-processor aware replacement for scsynth 

► exposes parallelism to the user of the language 

► designed for low-latency real-time applications 
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Parallelism 


► channels 

► voices, polyphonic music 

► tracks/buses 

► additive synthesis 
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Signal Gra 

ph Parallelism 



► What is the size of the synthesis graph? 

► How big are the CPU expenses of the nodes? 

► Graph nodes can be combined to avoid scheduling overhead. 


Tim Blechmann tim@klingt.org 


supernova - A Multiprocessor Aware Real-Time Audio Synthesis 




Introduction 
SuperCollider Node Graph 
Architecture of supernova 
Performance Results 
Summary 

Node Dependencies 


► There may be implicit dependencies based on resource access! 

► A dependency analysis may not be trivial 

► Automatic dependency analysis may be difficult or impractical 
to implement 


SuperCollider &i supernova 

Challenges of Parallel Audio Synthesis 
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SuperCollider & supernova 

Challenges of Parallel Audio Synthesis 

Real-Time Node Scheduling 



► How can the node graph be traversed in a real-time context? 

► Deadlines are almost one milliseconds (64 samples)! 

► Scheduling latency may be hundreds of microseconds (unless 
we are on highly tuned hardware with RT preemption 
patches). 

► Avoid locks! 
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SuperCollider Node Graph 

Parallel Groups 


SuperCollider Node Graph 


► SuperCollider uses groups for structuring the audio synthesis 

► Groups are linked lists of nodes 

► Nodes can be groups or synths. 
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Groups - 

Features 



► Form a tree hierarchy with a group as root 

► Syntactic sugar for organizing the audio synthesis 

► Multiple nodes can be addressed as one entity 

► Expose the order of execution explicitly to the user 
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Groups - 

An Example 



var gen_group, fx; 
gen_group = Group.new; 

4. do { 

Synth.head(gen_group, \myGenerator) 

>; 

fx = Synth.after(gen_group, \myFx); 
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Groups - An Example (2) 


Root Group 
Group: gen_group 
Synth: myFx 
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Child Group 
Synth: myGenerator 
Synth: myGenerator 
Synth: myGenerator 
Synth: myGenerator 
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Parallel Groups 



How to introduce parallelism to concept of the SuperCollider node 
graph? 
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Parallel Groups 



How to introduce parallelism to concept of the SuperCollider node 
graph? 


Parallel Groups 
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Parallel Groups 



► Semantics similar to groups 

► Nodes can be executed in parallel 

► Integrates well into the SuperCollider node graph: 

One more class, one more OSC command for the server 

► Backward compatible: Old code still keeps its semantics 

► Forward compatible: Parallel groups can easily be emulated 
with groups 
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Parallel Groups - An Example 



var gen_group, fx; 
gen_group = PGroup.new; 

4. do { 

Synth.head(gen_group, \myGenerator) 

>; 

fx = Synth.after(gen_group, \myFx); 
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Parallel Groups - An Example 

(2) 
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Features and Issues 
SuperCollider Unit Generators 


Architecture of supernova: Features 


► Focuses on latency instead on throughput 

► No pipelining, so no additional latency 

► Mostly lock-free synchronization (boost.lockfree) 

► SC node graph needs to be transferred into a dependency 
graph representation. 
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Features and Issues 
SuperCollider Unit Generators 


Architecture of supernova: Issues 


► Idle threads perform busy waiting (produces heat, takes 
resources) 

► No automatic dependency checking 

► Some use cases may be difficult to formulate to get the best 
performance 

► ... and no non-rt synthesis (yet) 
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Features and Issues 

SuperCollider Unit Generators 


SuperCollider Unit Generators 


► Supernova can load SuperCollider Unit Generators 

► Unit generators need to be adapted to ensure data consistency 

► All unit generators from the SuperCollider distribution have 
been ported to supernova 

► Some unit generators from the sc3-plugins as well 
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Features and Issues 

SuperCollider Unit Generators 


Resource Consistency 


► Reader-writer spinlocks are used to ensure data consistency 

► Reading the same resource from parallel synths is safe 

► Writing to the same resource from parallel synths may be safe 
Out. ar is safe 

ReplaceOut. ar is not 

► Writing to the same resource from parallel synths increases 
contention! 
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SuperCollider Unit Generators 

SuperCollider Unit Generators 

An Example 

void In_next_a(IOUnit *unit, : 
{ 

[. . .] 

int inNumSamples) 

for (int i=0; i<numChannels; ++i, 
in += bufLength) { 

if (touched[i] == bufCounter) 

Copy(inNumSamples 

else 

, OUT(i), in); 

Fill(inNumSamples 

> 

> 

, OUT(i), O.f); 
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Features and Issues 

SuperCollider Unit Generators 


SuperCollider Unit Generators: An Example 


void In_next_a(IOUnit *unit, int inNumSamples) 

{ 

[. . .] 

for (int i=0; i<numChannels; ++i, 
in += bufLength) { 

int32 busChannel = (int32)fbusChannel + i 
ACQUIRE_BUS_AUDIO_SHARED(busChannel); 
if (touched[i] == bufCounter) 

Copy(inNumSamples, OUT(i), in); 

else 

Fill(inNumSamples, OUT(i), O.f); 
RELEASE_BUS_AUDIO_SHARED(busChannel); 


> 
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Throughput Benchmarks 

Latency Benchmarks 


1024 Lightweight Synths 


CPU Utilization in Percent 



Tim Blechmann tim@klingt.org 


supernova - A Multiprocessor Aware Real-Time Audio Synthesis 





















Introduction 
SuperCollider Node Graph 
Architecture of supernova 
Performance Results 
Summary 


Throughput Benchmarks 

Latency Benchmarks 


30 Heavyweight Synths 


CPU Utilization in Percent 



Tim Blechmann tim@klingt.org 


supernova - A Multiprocessor Aware Real-Time Audio Synthesis 





















Introduction 
SuperCollider Node Graph 
Architecture of supernova 
Performance Results 
Summary 


Throughput Benchmarks 

Latency Benchmarks 


128 Synths with High Resource Contention 


CPU Utilization in Percent 
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Throughput Benchmarks 

Latency Benchmarks 


128 Synths with Low Resource Contention 


CPU Utilization in Percent 


70 - 

60 =- - - 

50 - 
40 

30 - - - 

20 - _ 
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scsynth groups pgroups -T 1 pgroups -T 2 pgroups -T 3 pgroups -T 4 
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Throughput Benchmarks 

Latency Benchmarks 


Speedup Overview 



Small Synths 

Large Synths 

High Contention 

Low Contention 

scsynth 

1 

0.95 

1 

0.98 

sequential groups 

1 

1 

1 

1 

parallel groups, 1 thread 

0.98 

1 

1 

1 

parallel groups, 2 threads 

1.72 

2 

1.87 

1.94 

parallel groups, 3 threads 

2.33 

2.73 

2.61 

2.73 

parallel groups, 4 threads 

2.88 

3.75 

3.16 

3.33 


Tim Blechmann tim@klingt.org 


supernova - A Multiprocessor Aware Real-Time Audio Synthesis 







Introduction 
SuperCollider Node Graph 
Architecture of supernova 
Performance Results 
Summary 


Throughput Benchmarks 
Latency Benchmarks 


Node Graph Parsing, Group 


DSP Queue Creation Times for Sequential Group with 512 Synths 



us 
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Throughput Benchmarks 
Latency Benchmarks 


Node Graph Parsing, Parallel Group 


DSP Queue Creation Times for Parallel Group with 512 Synths 



us 
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Summary 


► supernova is a drop-in replacement for scsynth 

► supernova extends the SuperCollider node graph by a simple 
concept 

► not a single crash during a concert 
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Conclusion 


Tha 

nks! 


Questions? 
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